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1.  Introduction 

Suppose  we  are  interested  in  the  density  /  „  of  some  statistic  Tn  (x  t.  •  •  •  ,  x„  )  where 
x  •  •  •  ,  x„  are  n  independent  identically  distributed  observations  with  the  underlying 
density  /  .  If.  after  suitable  standardization.  7",,  is  asymptotically  normally  distributed,  one 
can  usually  improve  the  approximation  of  J  „  given  by  the  asymptotic  distribution  by  using 
the  first  few  terms  of  an  Edgeworth  expansion  (cf.  Feller  1971.  Ch.  16).  This  is  an  expansion 
in  powers  of  n  "V:,  where  the  constant  term  is  the  normal  density.  It  turns  out  in  general 
that  the  Edgeworth  expansion  provides  a  good  approximation  in  the  center  of  the  density,  but 
can  be  inaccurate  in  the  tails  where  it  can  even  become  negative.  Thus  the  Edgeworth  expan¬ 
sion  can  be  unreliable  for  calculating  tail  probabilities  (the  values  usually  of  interest)  when 
the  sample  size  is  small.  Saddlepoint  and  small  sample  asymptotic  techniques  overcome  this 
problem. 

Saddlepoint  techniques  were  used  by  H.  E  Daniels  in  a  pioneering  paper  in  1954  to 
derive  an  approximation  to  /  „  where  T„  is  the  mean  of  x  j,  •  •  •  ,  x„  .  The  key  idea  is  as 
follows.  The  density  /  „  can  be  written  as  an  integral  on  the  complex  plane  by  means  of  a 
Fourier  transform.  Since  the  integrand  is  of  the  form  exp  (n  w  (z  )),  the  major  contribution 
to  this  integral  for  large  n  will  come  from  a  neighborhood  of  the  saddlepoint  z  q,  a  zero  of 
w’(z  ).  By  means  of  the  method  of  steepest  descent,  one  can  then  derive  a  complete  expansion 
for  f  „  with  terms  in  powers  of  n~l.  Daniels  (1954)  also  showed  that  this  expansion  is 
equivalent  to  that  obtained  using  the  idea  of  the  conjugate  density  (see  Cramer,  1938;  Khin- 
chin.  1949).  The  key  point  can  be  summarized  as  follows.  First  recenter  the  original  under¬ 
lying  distribution  /  at  the  point  t0  where  f  „  has  to  be  approximated,  that  is,  to  /  define 
its  conjugate  (or  associate)  density  h,  .  Then  use  the  Edgeworth  expansion  locally  at  t  q  with 
respect  to  ht  and  transform  the  results  back  in  terms  of  the  original  density  /  .  Since  t  o  is 
the  mean  of  the  conjugate  density  ht  ^  the  Edgeworth  expansion  at  t  o  with  respect  to  ht  is  in 
fact  of  order  n  and  provides  a  good  approximation  locally  at  that  point. 


'  •  •  i * 
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Small  sample  asymptotic  techniques  which  are  closely  related  to  saddlepoint  techniques 
were  introduced  by  Hampel  (1973)  and  are  based  on  the  idea  of  recentering  the  original  distri¬ 
bution.  The  main  difference  is  the  expansion  of  the  log-derivative  /  „  7  f  „  instead  of  /  „  . 
A  consequence  of  this  is  that  the  normalizing  constant,  that  is  the  constant  that  makes  the 
total  mass  equal  1.  must  be  determined  numericallv.  Often  this  turns  out  to  be  an  advantage 
since  this  rescaling  mav  improve  the  approximation.  The  unusual  characteristic  of  these 
approximations  is  that  the  first  few  terms  lor  even  just  the  leading  term)  of  the  expansion 
often  give  a  very  accurate  approximation  in  small  sample  sizes. 

In  the  last  few  years  there  has  been  a  revival  of  interest  in  this  area.  Small  sample 
asymptotic  approximations  are  now  available  for  M-estimates  a  location  (Field  and  Hampel, 
1982;  Daniels.  1983)  and  for  general  multivariate  M -estimators  (Field,  1982).  Durbin  (1980) 
applies  similar  techniques  to  derive  approximations  of  the  density  of  sufficient  estimators. 
Field  and  Ronchetti  (1983)  derive  small  sample  asymptotic  approximations  to  the  tail  area  of 
M-statistics  and  use  them  in  robust  testing.  We  refer  to  the  papers  by  Bamdorff-Nielson  and 
Cox  (1979)  and  Field  and  Hampel  (1982)  for  an  overview  and  comparison  between  these  new 
techniques  and  the  classical  ones. 

The  main  goal  of  this  paper  is  to  show  that  saddlepoint  techniques  can  be  carried  out  for 
general  statistics,  including  for  example  linear  combinations  of  order  statistics.  In  section  2  we 
present  the  basic  idea.  We  show  that  whenever  an  Edgeworth  expansion  for  the  density  /  „ 
of  T„  is  available,  a  saddlepoint  approximation  can  be  carried  out  and  will  in  general  improve 
it.  In  fact,  from  the  Edgeworth  expansion  we  obtain  an  approximation  for  the  cumulant  gen¬ 
erating  function  of  7’„  .  Using  standard  saddlepoint  techniques  we  then  work  out  an  expan¬ 
sion  for  /  n  in  powers  of  n  In  the  section  3,  we  apply  this  result  to  L-statistics  and  in 
sections  4  and  5  we  present  numerical  results  for  the  most  efficient  L-estimator  under  the 
logistic  distribution  and  for  trimmed  means  of  exponential  observations.  Exact  results  are 
compared  with  asymptotic  normal,  Edgeworth,  and  saddlepoint  approximations.  This  shows 
the  accuracy  of  the  latter  in  very  small  sample  sizes.  Finally,  in  section  6  we  outline  some 
further  research  directions,  including  the  relationship  with  bootstrap  techniques. 
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2.  General  Saddlepoint  Approximations 


Let  x  j,  •  •  •  ,  xr  be  n  independent  identically  distributed  real  valued  random  variables 
with  density  /  and  let  Tn(x  i,  •  •  •  ,  x„  )  be  a  real  valued  statistic.  Denote  by  /  „  the  den¬ 
sity  of  Tn  .  by  M„(t  )=/ etx  f  „  ( x  )dx  the  moment  generating  function,  by 
K„  (r  )  =  log  M„  (/  )  the  cumulant  generating  function,  and  by  p„(i  1  =  Mn  (it  ) the  charac¬ 
teristic  function. 


Bv  Fourier  inversion 


oo 

/  »  (x  )  *  7-/  M„  (it  )e  -*x  dt 

— O© 

l  oo 

-  /  M„(nT)e  ~nTx  dT 


-  n  f 

2mrlj 


n  (7  )-7z  > . 


where  t  is  any  real  number  in  an  interval  containing  the  origin  where  the  moment  generat¬ 


ing  function  exists,  and 


R„  (T  )  =  K„  ( nT  )/  n. 


Note  that  if  T„  is  the  arithmetic  mean.  R„  (T  )  =  K  (T  ),  the  cumulant  generating  function 
of  the  underlying  density  /  and  in  this  case  (2.1)  equals  formula  (2.2)  in  Daniels  (1954). 
The  idea  is  to  approximate  R„  ( T  )  and  then  apply  the  saddlepoint  technique  to  the  integral  in 
(2.1)  along  a  suitable  choice  of  the  path  of  integration. 

Suppose  an  Edgeworth  expansion  for  /  „  is  available  and  denote  by  /  „  the  Edgeworth 
approximation  up  to  and  including  the  term  of  order  n  -l.  Let  M„  and  Kn  be  the  moment 
generating  function  and  the  cumulant  generating  function  of  /  „  respectively,  and  let 
~R„(T  )  -  K„  (nT  )/  n  .  Then  bv  (2.1)  we  have 


T 

i 
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/„  (x  )  =  Z„  (x  )+/„  (x  ) 


T+i  OO 


n(R„  IT  >-7z  ) 


=  Zn(x)+~r  f  e . dT 

T«,  0^ 


(2.3) 

(2.4- 


where  Zn  (x  )  =  /  „  (x  )—  f  n  (x  ).  Nov.  the  classical  Edgeworth  expansion  provides  an 
expansion  for  Rn  ( 7  )  (see  below)  and  this  choice  uniformly  bounds  the  error  term  Zn  . 
Applying  the  saddlepoint  technique  to  the  integral  in  (2.4)  improves  the  Edgeworth  approxi¬ 
mation  by  eliminating  the  term  of  order  n  More  precisely,  f  rom  the  Edgeworth  approxi¬ 
mation  (up  to  the  term  of  order  n~l)  one  can  obtain  log  p„  ( t  )  and  therefore  R„  (T  )  in  terms 
of  the  cumulants.  That  is. 


_  .  nrfT2  .  *3n  cr*n  2T  3  K4ncrn4n  Y  4 

Rn  (T  )  =  fi„  T  +  - - -  +  - - -  +  - - - 


(2.5) 


where  n„  is  the  mean  and  <r„-  the  variance  of  Tn  .  Note  that  n„  ~  O  (1).  <rn  =  O  in  * ), 
and  k  j„  =  0  (n  ~7/  2+l)  for  j  -  3,4,  since  we  have  assumed  that  the  Edgeworth  expansion 
for  /  „  exists.  In  general,  n„  and  <r„  are  not  known  exactly  but  expansions  of  the  form 


Mn  -  M  +  ~  +  o  (n  “*), 
n 


<r„  =  ~  +  — ~  +  o  (n  3/  2). 
n  *  n  '  ^ 


will  suffice  to  keep  the  same  order  in  the  approximation. 

Applying  the  saddlepoint  technique  (cf.  Daniels,  1954)  to  the  integral  in  (2.4)  gives  the 
saddlepoint  approximation  of  f  „  with  error  of  order  n  _l: 


gn  )  = 


2 nR  (T0  ) 


I '/: 


n  {Rn  IT,,  >  —  7„  x 


(2.6) 


where  T0  is  the  saddlepoint  determined  as  a  solution  to  the  equation 

R‘„(T0)  =  x 

and  R  and  R  denote  the  first  two  derivatives  of  R„  . 


Jk- 

■%: 


% 


r-  —  4 

* 


•l 


£*1 

•  , 
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Remark  1;  Another  approximation  for  R„  (T  )  can  be  obtained  using  p„  ( l  )  given  by  the  Edge- 
worth  approximation  instead  of  the  expansion  of  log  p„  U  ).  This  amounts  to  approximating 
Rn  (T  )  by 


Rn(T)  =  nnT  +  —  +  -log 

l  n 


1+ 


n  in  3*  4„  *  W’f  A+Kfnn  b*?T  b 
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(2.7) 


Since  the  saddlepomt  approximation  based  on  (2.7)  gives  a  poorer  approximation  in  the  exam¬ 
ples  considered  than  that  based  on  (2.5)  we  will  not  include  this  approach  in  the  numerical 
examples  presented  in  sections  4  and  5. 


Remark  2;  By  the  same  computations  as  in  Daniels  ( 1 954 )  one  can  express  /  „  by  means  of  its 
conjugate  density,  namely 

/„(i)  =  /l,*'TK?'  h„  ,T(0).  (2.8) 

where  hn  tT  is  the  conjugate  density  proportional  to  e  n  Th  f  „  (u  +  x  ).  The  choice  t-T  „  and 
an  Edgeworth  expansion  of  h„  T(0)  leads  to  the  saddiepoint  approximation  (2.6).  Note  that  the 
term  of  order  n  disappears  because  /  „  is  recentered  at  x  through  h„  >T,  i^. 

Ekit  V  -  f  uh„  ,r(u  )du  -  Rn  (r)-x  =  0 

if  r  —T0,  the  saddiepoint.  However,  one  can  think  of  other  ways  of  recentering  /  „  .  For 
instance,  one  can  use  the  median  instead  of  expectation  and  solve  the  equation  (for  T  j) 

o 

f  hnj  (,u)du  =  1  /  2  . 


Remark 3:  An  alternative  way  of  approximating  the  density  of  a  general  statistic  by  means  of 
saddiepoint  techniques  is  the  following  (see  Field,  1982).  Suppose  T„  can  be  written  as  a 
functional  T  of  the  empirical  distribution  function  Fin  [  i.e.  T„  =  T  {F ln  *).  First  linearize 
T„  using  the  first  term  of  a  von  Mises  expansion  (cf.  von  Mises,  1947) 

T„  =  T  (F  )  +  L„(T  J  ),  (2.9) 

where  F  is  the  underlying  distribution  of  the  observations. 
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Ln(T  ,V  )  =  n~'  £  /}■  (x:  :7  .7  ).  (2.101 

;  =  I 

and  IF  (x  J  J )  is  the  influence  f  unction  of  7  at  )■  Icf.  Hampel.  1968.  1974).  Now  applv 
the  classical  saddlepoint  approximation  to  L„  (7  ,V  )  which  is  an  average  of  independent 
identically  distributed  random  variables.  In  preliminary  numerical  results  this  approximation 
does  not  perform  as  well  as  the  one  given  by  (2.6).  Moreover,  the  order  of  the  approximation 
based  on  (2.9)  is  an  open  question. 

3.  Applications  to  L-statistics 


In  this  section  we  apply  our  general  saddlepoint  technique  to  derive  approximations  to 
the  density  of  linear  combinations  of  order  statistics. 

We  consider  statistics  of  the  form 


Tn  ~  n  *  Z  Cyt  x  (,  ,  (3.1 ) 

t  *l 

where  x<i)^X(2)^  ^xln  i  are  the  order  statistics  and  c  i„  .  ■  ■  ■  ,cnn  are  weights  gen¬ 
erated  by  a  function  J  :  (0.1  )—R  . 


Typically  the  conditions  imposed  on  J  are  those  that  guarantee  the  existence  of  an  Edgeworth 
expansion  (see  below). 

The  distribution  properties  of  l.-statistics  have  been  investigated  by  many  authors.  Exact 
distributions  under  special  underlying  distributions  can  be  found  in  WeisbeTg  (1971),  Cicchi- 
telli  (1976),  and  David  (1981).  Asymptotic  normality  of  these  statistics  has  been  shown  under 
different  sets  of  conditions:  see,  for  instance,  Chernoff,  (iastwirth,  and  Johns  (1967),  Shorack 
(1972),  Stigler  (1974),  David  (1981).  Finally,  Helmers  (1979,1980)  and  van  Zwet  (1979) 
derived  Edgeworth  expansions  for  L-statistics  with  remainder  o  (n  -l).  These  will  be  the  basic 
elements  of  our  approximation  for  we  use  them  in  conjunction  with  the  saddlepoint  tech¬ 
nique. 


* 
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To  calculate  the  saddlepoint  approximation  to  f  n  at  x  we  need  to  evaluate  the  func¬ 
tion  R„  ( t  )  and  its  first  two  derivatives  at  the  point  T  0  where  T  n  is  such  that 

h  „  (7  =  x  (3.2) 

and  R„  ( l  )  is  given  by  equation  (2.5). 

We  would  like  to  be  able  to  plot  and  study  the  saddlepoint  approximation  of  the  entire 
density,  so  we  typically  calculate  the  saddlepoint  approximation  at  about  five  hundred  x  coor¬ 
dinates.  Thus,  the  procedure  has  been  implemented  as  a  single  precision  Fortran  program 
(within  the  S  statistical  environment).  The  implicit  equation  (3.2)  is  solved  lor  each  x  coor¬ 
dinate  using  a  Newton  method.  The  implemented  strategy  is  to  order  the  x  values  and  choose 
a  starting  point  which  gives  convergence  to  the  required  accuracy  for  the  first  x  value.  For 
each  subsequent  point,  the  initial  solution  for  the  Newton  method  is  chosen  to  be  the  solution 
for  the  previous  x  coordinate.  Since  the  solutions  do  not  change  much  for  small  changes  in 
the  x  coordinate,  convergence  for  subsequent  points  is  rapid  and  the  method  is  fast  and 
effective. 

Saddlepoint  approximations  for  distributions  or  tail  areas  are  obtained  from  the 
corresponding  density  approximations  bv  integrating  using  Simpson’s  rule.  A  high  degree  of 
accuracy  is  obtained  due  to  the  large  number  (501)  points  at  which  the  density  is  evaluated. 

4.  Example  1:  The  Most  Efficient  L-Estimator  Under  the  Logistic  Distribution 


As  a  first  example  of  the  technique  described  above  we  consider  the  asymptotically  first 
order  efficient  L-estimator  for  the  center  9  of  the  logistic  distribution 

1 


/•'  (x  -9 )  = 


1+e 


-(x  -0  )  ' 


This  L-estimator  has  the  weight  function 

J  (s  )  ~  bs  ( 1  — a  ). 


Thus  we  are  approximating  the  statistic  Tn  of  the  form  (3.1)  where, 
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9 


=  6 


i 

n  +  1 


1- 


t 

n  +1 


Helmers  (1980*  derives  the  Edgeworth  expansion  lor  the  distribution  of 
TV  =  (T„  —  fj.„  )'  <7„  .  and  for  the  distribution  of  the  related  statistic  T r**  -  n  ‘/2iTn  —fj.)/  cr. 
where  /jl  and  cr  are  the  asvmptotic  mean  and  variance.  The  Edgeworth  expansion  for  the  den¬ 
sity  of  T  *n  is  given  bv 


/%(*)■  d*x  ) 


1  +  — 7— (jc  3-3jc  4-bx  2+3 )+^-zP-(x  b-\Sx  a+45x  2-15) 


*4r. 


-> 

K‘ln 


24 
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,(4.1) 


where,  )  is  the  standard  normal  density,  kj„  =  0,  and  K4n  =  24/  (20n  ).  This  approxi¬ 
mation  forms  the  basis  for  the  saddlepoint  approximation.  It  should  be  noted  that  in  this  case 
Ki„  *  0  so  the  term  of  order  n  disappears  in  the  Edgeworth  expansion.  Thus,  this  expan¬ 
sion  is  of  order  n  ~ 1  and  should  be  very  competitive  with  the  saddlepoint  approximation. 

The  Edgeworth  expansion  for  T„ **  is  derived  from  (4.1)  by  using  the  approximations 

=  /i+0(l/n2)  (4.2) 

and. 

<rn  -  V 3/  n  /l+(  1 1  - tt2)'/  3/  n  3/  2+0  (1/  n  2).  (4.3) 

Since  the  exact  mean  and  variance  for  the  statistic  T„  are  not  available,  the  equations  (4.2) 
and  (4.3)  can  be  used  in  the  saddlepoint  method  without  changing  the  order  of  the  approxima¬ 
tions. 


Numerical  results  for  the  distribution  of  the  statistic  T„ **  for  sample  sizes  3.  4.  10,  and 
25  are  given  in  tables  4.1-4.4  for  the  right  half  of  the  distribution  since  the  density  is  sym¬ 
metric.  The  exact  values  for  the  distribution  are  taken  from  Helmers  (1980).  These  exact 
values  were  calculated  by  numerical  integration  for  sample  sizes  3  and  4,  and  by  Monte-Carlo 
simulation  using  25,000  samples  for  sample  sizes  10  and  25.  The  rescaled  saddlepoint  approxi¬ 
mation  was  calculated  by  integrating  the  saddlepoint  approximation  for  the  density  and  then 
rescaling  so  that  the  density  integrates  to  one.  This  technique  was  recommended  by  Hampel 
(1973)  as  a  method  for  improving  the  accuracy  of  saddlepoint  approximations  in  small  sample 
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sizes.  The  unsealed  saddlepoint  approximation  for  the  tail  area  was  calculated  by  integrating 
the  right  tail  of  the  saddlepoint  approximation  for  the  density  and  subtracting  from  1.  The 
values  lor  the  Edgeworth  approximation  were  recalculated  from  the  formula  given  in  Hel 
mers  (1980). 

Figure  4.1  plots  the  exact  distribution,  rescaled  saddlepoint,  Edgeworth,  and  normal 
approximations  for  sample  size  three.  This  plot  shows  that  both  the  rescaled  saddlepoint  and 
Edgeworth  approximations  are.  in  this  example,  superior  to  the  normal  approximation.  Figure 
4.2  shows  the  error  from  the  rescaled  saddlepoint  approximation  for  the  exact.  Edgeworth,  and 
normal  approximations.  It  is  clear  from  this  plot  that  there  is  some  sort  of  (probably  numeri¬ 
cal)  error  m  the  circled  values  of  the  exact  distribution.  Figure  4.3  shows  the  residuals  from 
the  exact  distribution  (with  the  value  in  error  eliminated)  for  the  rescaled  saddlepoint.  Edge- 
worth.  and  normal  approximations.  This  plot  clearly  indicates  that  the  rescaled  saddlepoint 
approximation  overall  improves  the  Edgeworth  approximation.  Also,  unlike  the  Edgeworth 
and  normal  approximations,  the  rescaled  saddlepoint  approximation  is  wider  tailed  than  the 
exact  distribution,  so  its  error  is  in  the  direction  of  giving  conservative  tests  and  confidence 
intervals. 

Figure  4.4  shows  the  residual  from  the  exact  distribution  for  sample  size  4.  The  overall 
impression  is  essentially  the  same  as  for  sample  size  3. 

Figure  4.5  shows  the  residual  from  the  rescaled  saddlepoint  approximation  for  the 
"exact",  normal,  and  rescaled  saddlepoint  distributions  for  sample  size  10.  In  this  case  the 
"exact"  values  were  determined  by  Monte-Carlo  simulation,  and  Figure  4.5  shows  that  the 
variation  in  the  "exact"  values  is  substantial  in  comparison  to  the  error  in  the  rescaled 
saddlepoint  and  Edgeworth  approximations,  so  it  is  more  difficult  to  be  certain  which  approxi¬ 
mation  is  better.  Figure  4.6  shows  the  plot  of  the  error  from  the  "exact"  distribution  for  the 
rescaled  saddlepoint  and  Edgeworth  approximations.  While  the  curves  are  no  longer  smooth 
due  to  the  noise  in  the  "exact"  values,  the  impression  is  much  as  before.  Finally,  for  sample 
size  25  both  the  saddlepoint  and  the  Edgeworth  approximation  are  nearly  indistinguishable 
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Overall  it  appears  that  both  the  rescaled  saddlepoint  and  Edgeworth  approximations  give 
very  good  approximations  to  the  exact  distribution  of  this  statistic.  The  rescaled  saddlepoint 
technique  generally  improves  the  Edgeworth  approximation,  and  tends  to  err  in  the  direction 
which  produces  conservative  tests  and  confidence  intervals. 

While  we  are  not  approximating  the  distribution  function  directly,  in  practice  these 
approximations  mav  be  used  for  calculating  tail  areas.  Thus,  it  is  of  interest  to  see  how  the 
unsealed  saddlepoint  approximation  for  the  distribution  performs  in  the  tails.  Figure  4.7 
shows  the  right  tail  area  for  the  right  half  of  the  distribution  lor  the  exact,  unsealed 
saddlepoint.  rescaled  saddlepoint.  Edgeworth,  and  normal  distributions.  From  this  plot  it  is 
clear  that  the  unsealed  saddlepoint  approximation  is  actually  a  better  approximation  in  the 
tails  of  the  distribution.  Figure  4.8  and  4.9  show  similar  plots  for  sample  sizes  4  and  10.  and 
these  plots  show  essentially  the  same  behavior. 


S.  Example  2:  Trimmed  Means  of  Exponential  Observations 


This  example  considers  approximations  to  the  distribution  of  trimmed  means  of  exponen¬ 
tial  observations.  Let  a/  and  au  be  the  fraction  of  the  observations  trimmed  from  the  upper 
and  lower  tails  respectively.  Thus  we  consider  statistics  of  the  f  orm  (3.1)  where 

0  fori  a;  or  i  >n(l  —  au  , 

Ctn  ~  n  /  k  otherwise. 


where  k  is  the  number  of  non-zero  weights.  Note  that  n  1  £  cin  —  1. 

i=i 

Helmers  (1979)  derives  the  Edgeworth  expansion  for  the  distribution  of 
T  *n  =  (Tn  —fjLn  )/  <r„  ,  for  trimmed  linear  combinations  of  order  statistics  with  general 
weights  on  the  observations  between  the  a;  and  1—  au  sample  quantiles,  and  zero  weight  on 
the  remaining  observations.  This  expansion  forms  the  basis  for  the  saddlepoint  approximation. 
Jn  the  case  of  linear  combinations  of  exponential  order  statistics  the  Edgeworth  expansion  for 


r.  .• 


Jjfk 
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the  density  of  Tn*  is  given  by  (4.1 )  with 


and 


where 


=  (2  I  a3m  )/  (  Z  a  l  )'v  2 
.'  =  i  =1 


*4n  =  (h  £  a  m  )•'  1  £  a  m 
’  =  I  /  = ) 


^  m  ~~ 


m  n  -  ;  +1  ,e  ,n ' 


£  C/J 


The  exact  densitv  lor  certain  linear  combinations  of  exponential  order  statistics  is  given 
in  David  (1981 ).  The  density  of  T„  is 

/  n  (x  )  =  £ - exp 

i  =  l  ain 

where, 

a r> 

am 

win  t-t  /  - 

II  '■am  ahn  ' 
h 

and 

am  =  <*m  /  n 

for  i  -  1,  •••  ,n  provided  ain  m  for  i  **  j  ■  We  will  only  consider  special  cases 
which  satisfy  this  condition. 

We  first  consider  the  standardized  mean  of  the  three  center  order  statistics  in  a  sample  of 
five  exponential  observations.  Figures  5.1 -5.3  show  the  exact  density  and  unsealed  saddlepoint 
approximation  compared  to  the  Edgeworth  approximation  which  includes  terms  up  to  order 
1/  n  (high  order  Edgeworth),  the  Edgeworth  approximation  which  includes  terms  up  to  order 
n  ~'/z  (low  order  Edgeworth),  and  the  normal  approximation.  These  figures  show  that  none  of 
the  approximations  perform  particularly  well  in  the  left  tail  of  the  density.  Both  of  the 
Edgeworth  approximations  even  become  negative  just  to  the  left  of  the  region  of  support  of 
the  exact  density.  Of  all  of  the  approximations,  the  high  order  Edgeworth  approximation  most 
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closely  follows  the  exact  density  on  the  region  of  positive  slope.  Neither  the  saddlepoint  or 
the  normal  approximations  can  ever  be  negative.  The  saddlepoint  approximation  outperforms 
the  normal  approximation  except  for  a  small  region  toward  the  center  of  the  density. 

In  the  right  half  of  the  density  both  of  the  Edgeworth  approximations  show  polynomial 
like  waves  with  the  low  order  Edgeworth  approximation  being  distinctly  bimodal.  The 
saddlepoint  approximation  follows  the  general  shape  of  the  exact  density  quite  closely  in  this 
half  of  the  density,  and  is  slightly  wide-tailed.  The  *+"  marks  under  the  right  tail  in  each  of 
the  figures  mark  the  .90,  .95,  .975.  .99,  and  .995  quantiles  of  the  exact  distribution. 

Figure  5.4  gives  a  close  up  of  the  density  and  the  approximations  in  the  10%  right  tail, 
and  Figure  5.5  plots  the  error  in  the  approximations  in  the  right  tail  for  the  standardized 
mean  of  the  center  three  order  statistics  in  a  sample  of  five  exponential  observations.  Figure 

5.6  shows  the  error  in  the  approximations  for  the  mean  of  the  center  9  order  statistics  in  a 
sample  of  11  exponential  observations.  Both  of  these  figures  show  the  same  general  behavior. 
In  both  sample  sizes,  the  saddlepoint  approximation  tends  to  be  fairly  stable  and  generally 
slightly  wide  throughout  the  tail  except  near  the  10%  point  in  sample  size  11.  Both  of  the 
Edgeworth  approximations  show  polynomial-like  w'aves.  The  low  order  Edgeworth  crosses 
the  exact  distribution  a  couple  of  times  in  the  tail  switching  from  being  too  wide  to  too  nar¬ 
row  and  back.  The  low  order  Edgeworth  approximation  performs  much  better  than  the  high 
order  Edgeworth  approximation  throughout  this  region  and  is  competitive  with  the 
saddlepoint  approximation  in  the  5%  tail.  It  is  sometimes  too  narrow  however.  In  the  5%  tail, 
the  error  in  the  normal  approximate  is  only  slightly  larger  in  absolute  value  than  the  error 
in  the  saddlepoint  approximation,  but  the  normal  approximation  is  uniformly  narrow. 

Figure  5.7  shows  the  exact  distribution  and  approximations  for  sample  size  5,  and  Figures 

5.7  and  5.8  show  the  error  from  the  exact  distribution  for  sample  sizes  5  and  11  in  the  10% 
left  tail.  As  in  the  case  of  the  density  approximations,  both  of  the  Edgeworth  approximations 
become  negative.  In  sample  size  5,  the  saddlepoint  approximation  performs  better  than  the 
others  throughout  the  tail  except  for  a  small  region  between  the  2.5%  and  5%  points  where 
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the  low  order  Edgeworth  approximation  crosses  the  exact  distribution.  For  sample  size  11.  the 
same  general  pattern  can  be  seen  in  Figure  5.9. 

Figures  5.10-5.12  plot  the  exact  and  approximate  tail  areas  for  the  right  10%  tail.  As  in 
the  density  case,  the  error  in  the  Edgeworth  approximations  shows  wavy  behavior  while  the 
saddlepoint  approximation  is  uniformly  wide.  The  low  order  Edgeworth  appears  to  be  the 
best  approximation  in  terms  of  absolute  error  in  the  7.5%  tail.  The  absolute  error  of  the  nor¬ 
mal  approximation  in  the  10%  tail  is  roughly  the  same  as  that  of  the  saddlepoint.  but  is  once 
again  it  is  uniformly  narrow  . 

6.  Discussion  and  Further  Research  Directions 

In  this  paper  we  have  presented  a  technique  for  converting  an  Edgeworth  approximation 
into  a  saddlepoint  approximation  and  have  applied  it  to  two  examples  of  L-statistics  for  which 
exact  results  are  available.  The  numerical  examples  considered  show'  that  this  saddlepoint 
approximation  is  in  general  competitive  with  the  Edgeworth  approximations.  In  the  first 
example,  the  saddlepoint  approximation  is  a  definite  improvement  In  the  second  example,  the 
results  are  mixed.  Nevertheless,  in  both  these  examples  the  saddlepoint  approximation  exhibits 
some  desirable  properties  which  the  Edgeworth  approximations  do  not.  First,  the  saddlepoint 
approximation  cannot  be  negative.  Second,  the  saddlepoint  approximation  is  unimodal  and  does 
not  show  the  polvnomial-like  waves  exhibited  by  the  Edgeworth  approximations.  Thus,  the 
error  in  the  saddlepoint  approximation  tends  to  be  locally  stable.  Finally,  the  saddlepoint 
approximation  tends  to  be  wide  in  the  tails  so  that  error  is  in  the  direction  of  giving  conserva¬ 
tive  tests  and  confidence  intervals.  As  with  the  saddlepoint  method  for  means  (Daniels.  1954). 
a  theoretical  advantage  of  this  method  is  that  the  leading  term  of  the  saddlepoint  approxima¬ 
tion  is  the  same  order  as  the  first  two  terms  of  the  Edgeworth  approximation. 

These  examples  have  other  interesting  features.  The  second  example  demonstrates  that  a 
higher  order  approximation  is  not  necessarily  better  in  spite  of  its  theoretical  appeal.  In  this 
example,  the  low  order  Edgeworth  approximation  is  far  superior  to  the  high  order  Edgeworth 
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approximation  in  the  right  tail  in  both  sample  sizes.  Both  examples  show  that  rescaling  does 
not  always  improve  the  quality  of  the  saddlepoint  approximation,  especially  if  interest  is  in 
the  tail  areas. 

Further  research  directions  include  application  of  these  techniques  to  robust  regression. 
Also,  these  techniques  can  be  made  nonparametric  by  replacing  the  underlying  distribution  by 
the  empirical  distribution  as  proposed  by  Field  (1984). 

The  problem  discussed  in  this  paper  was  originated  by  the  interest  in  the  distribution 
properties  of  the  so-called  broadened  letter  values  (bletter  values)  suggested  recently  by  J.W. 
Tukey  as  an  improvement  of  the  usual  letter  values  in  exploratory  data  analysis  (cf.  Tukey 
1977,  Mendoza.  1984).  Since  bletter  values  are  means  of  blocks  of  order  statistics,  the 
saddlepoint  approximations  derived  in  this  paper  can  be  carried  out. 
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